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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Robert G. Ulrichy 



(ii) TITLE OF INVENTION: Bacterial Superantigen 
Vaccines 

(iii) NUMBER OF SEQUENCES: 31 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Elizabeth Arwine 

(B) STREET: US Army MRMC -504 Scott Street 

MCMR-JA (Elizabeth Arwine-Patent 

Atty) 

(C) CITY: FORT DETRICK 

(D) STATE: MARYLAND 
ij^ (E) COUNTRY: USA 

(F) ZIP: 21702-5012 

S (v) COMPUTER READABLE FORM: 

if, (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: Apple Macintosh 
l"^ (C) OPERATING SYSTEM: Macintosh 7.5 

?y (D) SOFTWARE: Microsoft Word 6.0 

»^ 

3 (vi) CURRENT APPLICATION DATA: 

M (A) APPLICATION NUMBER: 

(B) FILING DATE: 
rU (C) CLASSIFICATION: 

m 

Q (vii) PRIOR APPLICATION DATA: 

|A (A) APPLICATION NUMBER: 08/882,431 

(B) FILING DATE: June 25, 1997 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Sana A. Pratt 

(B) REGISTRATION NUMBER: 39,441 

(C) REFERENCE /DOCKET NUMBER: 

(ix) TELECOMMUNICATION INFORMATION 

(A) TELEPHONE: (301) 619-2065 

(B) TELEFAX: (301) 619-7714 



(2) INFORMATION FOR SEQUENCE ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 



(D) TOPOLOGY: Unknown 
(ii) Molecule type: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1 
ATGAAAAAAA CAGCATTTAC ATTACTTTTA TTCATTGCCC 
TAACGTTGAC AACAAGTCCA CTTGTAAATG GTAGCGAGAA 
AAGCGAAGAA ATAAATGAAA AAGATTTGCG AAAAAAGTCT 
GAATTGCAGG GAACAGCTTT AGGCAATCTT AAACAAATCT 
ATTATTACAA TGAAAAAGCT AAAACTGAAA ATAAAGAGAG 
TCACGATCAA TTTCGACAGC ATACTATATT GTTTAAAGGC 
,p^TTTTACAG ATCATTCGTG GTATAACGAT TTATTAGTAC 
GTTTTGATTC AAAGGATATT GTTGATAAAT ATAAAGGGAA 
AAAAGTAGAC TTGTATGGTG CTTATGCTGG TTATCAATGT 
GCGGGTGGTA CACCAAACAA AACAGCTTGT ATGTATGGTG 
GTGTAACGTT ACATGATAAT AATCGATTGA CCGAAGAGAA 
AT^GTGCCG ATCAATTTAT GGCTAGACGG TAAACAAAAT 
ACAGTACCTT TGGAAACGGT TAAAACGAAT AAGAAAAATG 
TAACTGTTCA GGAGTTGGAT CTTCAAGCAA GACGTTATTT 
ACAGGAAAAA TATAATTTAT ATAACTCTGA TGTTTTTGAT 
GGGAAGGTTC AGAGGGGATT AATCGTGTTT CATACTTCTA 
CAGAACCTTC GGTTAATTAC GATTTATTTG GTGCTCAAGG 
ACAGTATTCA AATACACTAT TAAGAATATA TAGAGATAAT 
AAAACGATTA ACTCTGAAAA CATGCATATT GATATATATT 
TATATACAAG TTAAACATGG TAGTTTTGAC CAACGTAATG 
TTCAGATTAT TATGAACCGA GAATAATCTA 

INFORMATION FOR SEQUENCE ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 257 
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(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 
(ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Lys Thr Ala Phe Thr Leu Leu Leu 
1 5 10 

Phe lie Ala Leu Thr Leu Thr Thr Ser Pro 
15 20 

Leu Val Asn Gly Ser Glu Lys Ser Glu Glu 
25 30 

He Asn Glu Lys Asp Leu Arg Lys Lys Ser 

35 40 

Glu Leu Gin Gly Thr Ala Leu Gly Asn Leu 
45 50 

C3 Lys Gin He Tyr Tyr Tyr Asn Glu Lys Ala 

□ 55 60 

o 

ril Lys Thr Glu Asn Lys Glu Ser His Asp Gin 

^5 65 70 

Phe Arg Gin His Thr He Leu Phe Lys Gly 

T 75 80 

Phe Phe Thr Asp His Ser Trp Tyr Asn Asp 

S '° 

IL! Leu Leu Val Arg Phe Asp Ser Lys Asp He 

O 95 100 

Val Asp Lys Tyr Lys Gly Lys Lys Val Asp 
105 110 

Leu Tyr Gly Ala Tyr Ala Gly Tyr Gin Cys 
115 120 

Ala Gly Gly Thr Pro Asn Lys Thr Ala Cys 
125 130 

Met Tyr Gly Gly Val Thr Leu His Asp Asn 
135 140 

Asn Arg Leu Thr Glu Glu Lys Lys Val Pro 
145 150 

He Asn Leu Trp Leu Asp Gly Lys Gin Asn 
155 160 
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Thr Val Pro Leu Glu Thr Val Lys Thr Asn 
165 170 

Lvs Lys Asn Val Thr Val Gin Glu Leu Asp 

^ 175 180 

Leu Gin Ala Arg Arg Tyr Leu Gin Glu Lys 
185 190 

Tvr Asn Leu Tyr Asn Ser Asp Val Phe Asp 

^ 195 200 

Glv Lys Val Gin Arg Gly Leu He Val Phe 
205 210 

His Thr Ser Thr Glu Pro Ser Val Asn Tyr 
215 220 

Asp Leu Phe Gly Ala Gin Gly Gin Tyr Ser 
1^ 225 230 

S Asn Thr Leu leu Arg He Tyr Arg Asp Asn 

i 

^ Lys Thr He Asn Ser Glu Asn Met His He 
"•4 245 250 

C8 

^ Asp He Tyr Leu Tyr Thr Ser 
a 255 

ru 

m 

□ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 757 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Untaown 

(D) TOPOLOGY: Unknown 
(ii) Molecule type: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATGAGAAAAG CGAAGAAATA AATGAAAAAG ATTTGCGAAA 40 
AAAGTCTGAA TTGCAGGGAA CAGCTTTAGG CAATCTTAAA 80 
CAAATCTATT ATTACAATGA AAAAGCTAAA ACTGAAAATA 120 
AAGAGAGTCA CGATCAATTT CGACAGCATA CTATATTGTT 160 
TAAAGGCTTT TTTACAGATC ATTCGTGGTA TAACGATTTA 200 



(4) INFORMATION FOR SEQUENCE ID NO: 3: 
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TTAGTACGTT TTGATTCAAA GGATATTGTT GATAAATATA 240 

AAGGGAAAAA AGTAGACTTG TATGGTGCTT ATGCTGGTTA 280 

TCAATGTGCG GGTGGTACAC CAAACAAAAC AGCTTGTATG 320 

TATGGTGGTG TAACGTTACA TGATAATAAT CGATTGACCG 360 

AAGAGAAAAA AGTGCCGATC AATTTATGGC TAGACGGTAA 400 

ACAAAATACA GTACCTTTGG AAACGGTTAA AACGAATAAG 440 

AAAAATGTAA CTGTTCAGGA GTTGGATCTT CAAGCAAGAC 480 

GTTATTTACA GGAAAAATAT AATTTATATA ACTCTGATGT 520 

TTTTGATGGG AAGGTTCAGA GGGGATTAAT CGTGTTTCAT 560 

ACTTCTACAG AACCTTCGGT TAATTACGAT TTATTTGGTG 600 

1,^ CTCAAGGACA GTATTCAAAT ACACTATTAA GAATATATAG 640 

Q AGATAATAAA ACGATTAACT CTGAAAACAT GCATATTGAT 680 

j^y ATATATTTAT ATACAAGTTA AACATGGTAG TTTTGACCAA 720 

^^ CGTAATGTTC AGATTATTAT GAACCGAGAA TAATCTA 757 

9 

M (5) INFORMATION FOR SEQUENCE ID NO: 4: 

It (i) SEQUENCE CHARACTERISTICS: 

ru (A) LENGTH: 233 

Cn (B) TYPE: Amino Acid 

C3 (C) STRANDEDNESS : Unknown 

M (D) TOPOLOGY: Unknown 

(ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Glu Lys Ser Glu Glu lie Asn Glu Lys 
5 10 

Asp Leu Arg Lys Lys Ser Glu Leu Gin Gly 
15 20 

Thr Ala Leu Gly Asn Leu Lys Gin lie Tyr 
25 30 

Tvr Tyr Asn Glu Lys Ala Lys Thr Glu Asn 
35 40 

Lys Glu Ser His Asp Gin Phe Arg Gin His 
45 50 
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Thr lie Leu Phe Lys Gly Phe Phe Thr Asp 
55 60 

His Ser Trp Tyr Asn Asp Leu Leu Val Arg 

65 70 

Phe Asp Ser Lys Asp He Val Asp Lys Tyr 
75 BO 

Lys Gly Lys Lys Val Asp Leu Tyr Gly Ala 
85 90 

Tyr Ala Gly Tyr Gin Cys Ala Gly Gly Thr 
95 100 

Pro Asn Lys Thr Ala Cys Met Tyr Gly Gly 
105 110 



It Val Thr Leu His Asp Asn Asn Arg Leu Thr 

115 120 

Glu Glu Lys Lys Val Pro He Asn Leu Trp 
'•^ 125 130 

w 

Leu Asp Gly Lys Gin Asn Thr Val Pro Leu 
s 135 140 

Glu Thr Val Lys Thr Asn Lys Lys Asn Val 
rU 145 150 

m 

Q Thr Val Gin Glu Leu Asp Leu Gin Ala Arg 

M 155 160 

Arg Tyr Leu Gin Glu Lys Tyr Asn Leu Tyr 
165 170 

Asn Ser Asp Val Phe Asp Gly Lys Val Gin 

175 180 

Arg Gly Leu He Val Phe His Thr Ser Thr 
185 190 

Glu Pro Ser Val Asn Tyr Asp Leu Phe Gly 
195 200 

Ala Gin Gly Gin Tyr Ser Asn Thr Leu Leu 
205 210 

Arg He Tyr Arg Asp Asn Lys Thr He Asn 
215 220 
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Ser Glu Asn Met His He Asp He Tyr Leu 
225 230 



Tyr Thr Ser 



(6) INFORMATION FOR SEQUENCE ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1712 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GAACTAGGTA GAAAAATAAT TATGAGAAAA CACTATGTTG 40 
Q TTAAAGATGT TTTCGTATAT AAGTTTAGGT GATGTATAGT 80 

Q 

p TACTTAATTT TAAAAGCATA ACTTAATTAA TATAAATAAC 120 

^^j ATGAGATTAT TAAATATAAT TAAGTTTCTT TTAATGTTTT 160 

£0 

g TTTAATTGAA TATTTAAGAT TATAACATAT ATTTAAAGTG 200 



u 

fu 



TATCTAGATA CTTTTTGGGA ATGTTGGATA AAGGAGATAA 240 

AAAATGTATA AGAGATTATT TATTTCACAT GTAATTTTGA 280 

*n TATTCGCACT GATATTAGTT ATTTCTACAC CCAACGTTTT 320 

i"^ AGCAGAGAGT CAACCAGATC CTAAACCAGA TGAGTTGCAC 360 

AAATCGAGTA AATTCACTGG TTTGATGGAA GATATGAAAG 400 

TTTTGTATGA TGATAATCAT GTATCAGCAA TAAACGTTAA 440 

ATCTATAGAT CAATTTCTAT ACTTTGACTT AATATATTCT 480 

ATTAAGGACA CTAAGTTAGG GGATTATGAT AATGTTCGAG 520 

TCGAATTTAA AAACAAAGAT TTAGCTGATA AATACAAAGA 560 

TAAATACGTA GATGTGTTTG GAGCTAATTA TTATTATCAA 600 

TGTTATTTTT CTAAAAAAAC GAATGATATT AATTCGCATC 640 



AAACTGACAA ACGAAAAACT TGTATGTATG GTGGTGTAAC 



680 
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TGAGCATAAT GGAAACCAAT TAGATAAATA TAGAAGTATT 720 

ACTGTTCGGG TATTTGAAGA TGGTAAAAAT TTATTATCTT 760 

TTGACGTACA AACTAATAAG AAAAAGGTGA CTGCTCAAGA 800 

ATTAGATTAC CTAACTCGTC ACTATTTGGT GAAAAATAAA 840 

AAACTCTATG AATTTAACAA CTCGCCTTAT GAAACGGGAT 880 

ATATTAAATT TATAGAAAAT GAGAATAGCT TTTGGTATGA 920 

CATGATGCCT GCACCAGGAG ATAAATTTGC CCAATCTAAA 960 

TATTTAATGA TGTACAATGA CAATAAAATG GTTGATTCTA 1000 

AAGATGTGAA GATTGAAGTT TATCTTACGA CAAAGAAAAA 1040 

GTGAAATTAT ATTTTAGAAA AGTAAATATG AAGAGTTAGT 1080 

AATTAAGGCA GGCACTTATA GAGTACCTGC CTTTTCTAAT 1120 

j3 ATTATTTAGT TATAGTTATT TTTGTTATAT CTCTCTGATT 1160 

^2 TAGCATTAAC CCCTTGTTGC CATTATAGTT TTCACCAACT 1200 

TTAGCTGAAA TTGGGGGATC ATTTTTATCT TTACTATGGA 1240 

=P TAGTTACTGT GTCGCCGTTT TTAACGATTT GTTTCTCTTT 1280 

a 

H TAATTTGTCA GTTAATTTTT TCCATGCATC ATTTGCGTCA 1320 

!*= 

fy AACCTATTTC CATTTGGATT TATTCTTGAC AAATCAATTC 1360 

Q TTTTAACACT ATCGGTATTA ATCGGCTTGT TATTAAAATT 1400 

ACTAAGTTCA TCTAAATCAG CTGTACCCGT AATACTACTT 1440 

TCGCCACCAT TATTTAAATT GTACGTAACA CCAACTGTCT 1480 

CATTTGCTGT TTTATCGATA ATATTTGCTT CTTTCAAAGC 1520 

ATCTCTTACA TTTTTCCATA AGTCTCTATC TGTTATTTCA 1560 

GAAGCCTTTG CAACGTTATT AATACCATTA TAATTTGAAG 1600 

AAGAATGAAA ACCTGAACCT ACTGTTGTTA AAACTAAAGC 1640 

ACTTGCTATC AATGTTCTTG TTAATAGTTT TTTATTCATT 1680 

TTATTTTCTC CTATAACTTA TTTGCAATCG AT 1712 
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(7) INFORMATION FOR SEQUENCE ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 

(B) TYPE: Amino Acid 

( C ) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Tyr Lys Arg Leu Phe lie Ser His Val 
5 10 

lie Leu He Phe Ala Leu He Leu Val He 
15 20 

Ser Thr Pro Asn Val Leu Ala Glu Ser Gin 

25 30 

U 

□ Pro Asp Pro Lys Pro Asp Glu Leu His Lys 

C3 35 40 

rU Ser Ser Lys Phe Thr Gly Leu Met Glu Asp 

--^4 45 50 

M 

p Met Lys Val Leu Tyr Asp Asp Asn His Val 

55 60 

11 Ser Ala He Asn Val Lys Ser He Asp Gin 

ru " 

Phe Leu Tyr Phe Asp Leu He Tyr Ser He 
p 75 80 

Lys Asp Thr Lys Leu Gly Asp Tyr Asp Asn 
85 90 

Val Arg Val Glu Phe Lys Asn Lys Asp Leu 
95 100 

Ala Asp Lys Tyr Lys Asp Lys Tyr Val Asp 
105 110 

Val Phe Gly Ala Asn Tyr Tyr Tyr Gin Cys 

115 120 

Tyr Phe Ser Lys Lys Thr Asn Asp He Asn 
125 130 

Ser His Gin Thr Asp Lys Arg Lys Thr Cys 
135 140 
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Met Tyr Gly Gly Val Thr Glu His Asn Gly 

145 150 

Asn Gin Leu Asp Lys Tyr Arg Ser He Thr 
155 160 

Val Arg Val Phe Glu Asp Gly Lys Asn Leu 

165 170 

Leu Ser Phe Asp Val Gin Thr Asn Lys Lys 
175 180 

Lys Val Thr Ala Gin Glu Leu Asp Tyr Leu 

185 190 

Thr Arg His Tyr Leu Val Lys Asn Lys Lys 
195 200 

Leu Tyr Glu Phe Asn Asn Ser Pro Tyr Glu 
205 210 

Thr Glv Tyr He Lys Phe He Glu Asn Glu 
□ 215 220 

9 

O Asn Ser Phe Trp Tyr Asp Met Met Pro Ala 

fU 225 230 

CO Pro Gly Asp Lys Phe Ala Gin Ser Lys Tyr 

»P 235 240 

|i Leu Met Met Tyr Asn Asp Asn Lys Met Val 

M 245 250 



Asp Ser Lys Asp Val Lys He Glu Val Tyr 
255 260 

Leu Thr Thr Lys Lys Lys 
265 



(8) INFORMATION FOR SEQUENCE ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1712 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 



(ii) Molecule type: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GAACTAGGTA GAAAAATAAT TATGAGAAAA CACTATGTTG 
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TTAAAGATGT TTTCGTATAT AAGTTTAGGT GATGTATAGT 80 

TACTTAATTT TAAAAGCATA ACTTAATTAA TATAAATAAC 120 

ATGAGATTAT TAAATATAAT TAAGTTTCTT TTAATGTTTT 160 

TTTAATTGAA TATTTAAGAT TATAACATAT ATTTAAAGTG 200 

TATCTAGATA CTTTTTGGGA ATGTTGGATA AAGGAGATAA 240 

AAAATGTATA AGAGATTATT TATTTCACAT GTAATTTTGA 280 

TATTCGCACT GATATTAGTT ATTTCTACAC CCAACGTTTT 320 

AGCAGAGAGT CAACCAGATC CTAAACCAGA TGAGTTGCAC 360 

AAATCGAGTA AATTCACTGG TTTGATGGAA AATATGAAAG 400 

TTTTGTATGA TGATAATCAT GTATCAGCAA TAAACGTTAA 440 

Q ATCTATAGAT CAATTTCGAT ACTTTGACTT AATATATTCT 480 

□ 

p ATTAAGGACA CTAAGTTAGG GAATTATGAT AATGTTCGAG 520 

TCGAATTTAA AAACAAAGAT TTAGCTGATA AATACAAAGA 560 

TAAATACGTA GATGTGTTTG GAGCTAATGC TTATTATCAA 600 

TGTGCTTTTT CTAAAAAAAC GAATGATATT AATTCGCATC 640 

M AAACTGACAA ACGTUU^CT TGTATGTATG GTGGTGTAAC 680 

Cn TGAGCATAAT GGAAACCAAT TAGATAAATA TAGAAGTATT 720 

M ACTGTTCGGG TATTTGAAGA TGGTAAAAAT TTATTATCTT 760 

TTGACGTACA AACTAATAAG AAAAAGGTGA CTGCTCAAGA 800 

ATTAGATTAC CTAACTCGTC ACTATTTGGT GAAAAATAAA 840 
AAACTCTATG AATTTAACAA CTCGCCTTAT GAAACGGGAT 880 
ATATTAAATT TATAGAAAAT GAGAATAGCT TTTGGTATGA 920 
CATGATGCCT GCACCAGGAG ATAAATTTGA CCAATCTAAA 960 

TATTTAATGA TGTACAATGA CAATAAAATG GTTGATTCTA 1000 

AAGATGTGAA GATTGAAGTT TATCTTACGA CAAAGAAAAA 1040 

GTGAAATTAT ATTTTAGAAA AGTAAATATG AAGAGTTAGT 1080 

AATTAAGGCA GGCACTTATA GAGTACCTGC CTTTTCTAAT 1120 
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ATTATTTAGT TATAGTTATT TTTGTTATAT CTCTCTGATT 1160 

TAGCATTAAC CCCTTGTTGC CATTATAGTT TTCACCAACT 1200 

TTAGCTGAAA TTGGGGGATC ATTTTTATCT TTACTATGGA 1240 

TAGTTACTGT GTCGCCGTTT TTAACGATTT GTTTCTCTTT 1280 

TAATTTGTCA GTTAATTTTT TCCATGCATC ATTTGCGTCA 1320 

AACCTATTTC CATTTGGATT TATTCTTGAC AAATCAATTC 13 60 

TTTTAACACT ATCGGTATTA ATCGGCTTGT TATTAAAATT 1400 

ACTAAGTTCA TCTAAATCAG CTGTACCCGT AATACTACTT 1440 

TCGCCACCAT TATTTAAATT GTACGTAACA CCAACTGTCT 1480 

CATTTGCTGT TTTATCGATA ATATTTGCTT CTTTCAAAGC 1520 

ATCTCTTACA TTTTTCCATA AGTCTCTATC TGTTATTTCA 1560 

GAAGCCTTTG CAACGTTATT AATACCATTA TAATTTGAAG 1600 

AAGAATGAAA ACCTGAACCT ACTGTTGTTA AAACTAAAGC 1640 

ACTTGCTATC AATGTTCTTG TTAATAGTTT TTTATTCATT 1680 

TTATTTTCTC CTAJAACTTA TTTGCAATCG AT 1712 



INFORMATION FOR SEQUENCE ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Tyr Lys Arg Leu Phe lie Ser His Val 
5 10 

He Leu He Phe Ala Leu He Leu Val He 
15 20 

Ser Thr Pro Asn Val Leu Ala Glu Ser Gin 
25 30 
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Pro Asp Pro Lys Pro Asp Glu Leu His Lys 
35 40 

Ser Ser Lys Phe Thr Gly Leu Met Glu Asn 

45 50 

Met Lys Val Leu Tyr Asp Asp Asn His Val 
55 60 

Ser Ala lie Asn Val Lys Ser lie Asp Gin 
65 70 

Phe Arg Tyr Phe Asp Leu He Tyr Ser He 
75 80 

Lys Asp Thr Lys Leu Gly Asn Tyr Asp Asn 
85 90 

Val Arg Val Glu Phe Lys Asn Lys Asp Leu 
95 100 

2 Ala Asp Lys Tyr Lys Asp Lys Tyr Val Asp 

□ 105 110 

ru 

Val Phe Gly Ala Asn Ala Tyr Tyr Gin Cys 
CO 115 120 

s Ala Phe Ser Lys Lys Thr Asn Asp He Asn 

1.^ 125 130 

fU Ser His Gin Thr Asp Lys Arg Lys Thr Cys 

m 135 140 

b 

lA Met Tyr Gly Gly Val Thr Glu His Asn Gly 

145 150 

Asn Gin Leu Asp Lys Tyr Arg Ser He Thr 
155 160 

Val Arg Val Phe Glu Asp Gly Lys Asn Leu 
165 170 

Leu Ser Phe Asp Val Gin Tyr Asn Lys Lys 

175 180 

Lys val Thr Ala Gin Glu Leu Asp Tyr Leu 
185 190 

Thr Arg His Tyr Leu Val Lys Asn Lys Lys 
195 200 

Leu Tyr Glu Phe Asn Asn Ser Pro Tyr Glu 
205 210 
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Thr Gly Tyr lie Lys Phe lie Glu Asn Glu 
215 220 

Asn Ser Phe Trp Tyr Asp Met Met Pro Ala 

225 230 

Pro Gly Asp Lys Phe Asp Gin Ser Lys Tyr 
235 240 

Leu Met Met Tyr Asn Asp Asn Lys Met Val 

245 250 

Asp Ser Lys Asp Val Lys He Glu Val Tyr 
255 260 

Leu Thr Thr Lys Lys Lys 
265 



1^ 

b (10) INFORMATION FOR SEQUENCE ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
S (A) LENGTH: 1388 

• . (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Unknown 
^-Q (D) TOPOLOGY: Unlmown 

L (ii) Molecule type: DNA 

M (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ru 

in ATGAGTCAAC CAGATCCTAA ACCAGATGAG TTGCACAAAT 40 

□ 

CGAGTAAATT CACTGGTTTG ATGGAAAATA TGAAAGTTTT 80 

GTATGATGAT AATCATGTAT CAGCAATAAA CGTTAAATCT 120 

ATAGATCAAT TTCGATACTT TGACTTAATA TATTCTATTA 160 

AGGACACTAA GTTAGGGAAT TATGATAATG TTCGAGTCGA 200 

ATTTAAAAAC AAAGATTTAG CTGATAAATA CAAAGATAAA 240 

TACGTAGATG TGTTTGGAGC TAATGCTTAT TATCAATGTG 280 

CTTTTTCTAA AAAAACGAAT GATATTAATT CGCATCAAAC 320 

TGACAAACGA AAAACTTGTA TGTATGGTGG TGTAACTGAG 360 

CATAATGGAA ACCAATTAGA TAAATATAGA AGTATTACTG 400 

TTCGGGTATT TGAAGATGGT AAAAATTTAT TATCTTTTGA 440 



15 



□ 

Q 

m 
m 

s 

ru 
m 





AA 1 AAVj "iArvrt. 


AGGTGACTGC 


TCAAGAATTA 


480 




U X V^Vj J. V^rtV— A rt. 


TTTGGTGAAA 


AATAAAAAAC 


520 


m/^ rn A rn/^ 7\ 7\ > i ^'1 ^ 
1 L. i A i vj A/\ i i 


i AAUAAV- 1 V-VJ 


C r TT ATGAAA 


CGGGATATAT 


560 


Ti A A A •T"T*Tl B Tf 2i 


PB a a aTPAHA 


ATAGCTTTTG 


GTATGACATG 


600 






ATTTGACCAA 


TCTAAATATT 


640 


m A TV tr\/"f A (Ti/^rp A 
TAAivjAHjiA 


p a afPAP A AT 

V, AA 1 I^jAK- AA i 


AAAATGGTTG 


ATTC TAAAGA 


680 


TGTGAAGA i 1 


pa aP'T*T"T'ATP 
(jAAVjI 1 lAlU 


TTAPr^APAAA 


GAAAAAGTGA 


720 


AATTATATTT 


i AGAAAALr i A 


AATATGAAGA 


GTTAGTAATT 


760 


AAGGCAGGCA 


/-Mill 11 A m»/^ Aj^m 
L. 1 1 A 1 Au AV3 X 


ACCTGCCTTT 


TCTAATATTA 


800 


TTTAGTTATA 


/^rnrri jv 1 1 11 1 u 1 u 1 11 1 
yjx lAl 11 1 IVs 


TTATATCTCT 


CTGATTTAGC 


840 


71 mm A A t^r^t~*f^rn 

AT TAALL.L.I-. i 


lAjl iTjl^V-Al 1 


ATAGTTTTCAC CAACTTTAG 


880 


/-ipn/^ 7v 7v A mm/^o 
CTGAAAT1\jG 


vju^A 1 A 111 


TTATCTTTAC 


TATGGATAGT 


920 


m A ^ m/^ m^ rn<^ /*• 


PPPTTTTT A A 
UL.Ial 111 lAA 


CGATTTGTTT 


CTCTTTTAAT 


960 


TTGTCAGTTA 


Al 1111 1\.^A 


TGCATCATTT 


GCGTCAAACC 


1000 


rn A mmiTV/^ A mrn 
lAl 1 iVt^Ai 1 


rpcin A T"T"P ATT* 
1\3V3A1 1 lAl ± 


CTTGACAAAT 


CAATTCTTTT 


1040 


AAV, Al> 1 A 1 V, 




GCTTGTTATT 


AAAATTACTA 


1080 


AGTTu A X A 


a a TP A nPTTiT 

AA 1 U AVsU i\3 1 


ACCCGTAATA 


CTACTTTCGC 


1120 


/*• A A mni A mm 
L.AL.Q,Ai lAl J. 


TaaaTTf^TAP 

lAAAl IVslAV. 


GTAACACCAA 


CTGTCTCATT 


1160 


m*^ m/^ mrnmm A 
TGCTGTi 1 lA 


TPPATa ATAT 
il-ljAl AAl Al 


TTGCTTCTTT 


CAAAGCATCT 


1200 


CTTACATTTT 


TPPATA A^^TP 
Ia^U A 1 AAu 1 V- 


TCTATCTGTT 


ATTTCAGAAG 


1240 


CCTTTGCAAC 


GTTATTAATA 


CCATTATAAT 


TTGAAGAAGA 


1280 


ATGAAAACCT 


GAACCTACTG 


TTGTTAAAAC 


TAAAGCACTT 


1320 


GCTATCAATG 


TTCTTGTTAA 


TAGTTTTTTA 


TTCATTTTAT 


1360 


TTTCTCCTAT 


AACTTATTTG 


CAATCGAT 




1388 



(11) INFORMATION FOR SEQUENCE ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 239 
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(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Unknovm 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Gin Pro Asp Pro Lys Pro Asp Glu 

5 10 

Leu His Lys Ser Ser Lys Phe Thr Gly Leu 
15 20 

Met Glu Asn Met Lys Val Leu Tyr Asp Asp 
25 30 

Asn His Val Ser Ala He Asn Val Lys Ser 
35 40 

He Asp Gin Phe Arg Tyr Phe Asp Leu He 
45 50 

Tvr Ser He Lys Asp Thr Lys Leu Gly Asn 
55 60 

Tvr Asp Asn Val Arg Val Glu Phe Lys Asn 
65 70 

Lys Asp Leu Ala Asp Lys Tyr Lys Asp Lys 
75 80 

Tvr Val Asp Val Phe Gly Ala Asn Ala Tyr 
85 90 

Tvr Gin Cys Ala Phe Ser Lys Lys Thr Asn 

95 100 

Asp He Asn Ser His Gin Thr Asp Lys Arg 
105 110 

Lvs Thr Cys Met Tyr Gly Gly Val Thr Glu 

115 120 

His Asn Gly Asn Gin Leu Asp Lys Tyr Arg 
125 130 

Ser He Thr Val Arg Val Phe Glu Asp Gly 
135 140 

Lvs Asn Leu Leu Ser Phe Asp Val Gin Thr 
145 150 

Asn Lys Lys Lys Val Thr Ala Gin Glu Leu 
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160 



His Tyr Leu Val Lys 

170 



Glu Phe Asn Asn Ser 
180 



Tyr lie Lys Phe lie 

190 



Phe Trp Tyr Asp Met 
200 

Asp Lys Phe Asp Gin 
210 



Met Tyr Asn Asp Asn 
220 

Lys Asp Val Lys lie 
230 



Thr Lys Lys Lys 



^4 
CO 

s (12) INFORMATION FOR SEQUENCE ID NO: 11: 



i"^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 
fU (B) TYPE: Nucleic Acid 

Cn (C) STRANDEDNESS : Unknown 

C3 (D) TOPOLOGY: Unknown 



(ii) Molecule type: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TAAGGAGAAT TAAAAATGAA TAAAAAATTA CTAATGAATT 40 

TTTTTATCGT AAGCCCTTTG TTGCTTGCGA CAACTGCTAC 80 

AGATTTTACC CCTGTTCCCT TATCATCTAA TCAAATAATC 120 

AAAACTGCAA AAGCATCTAC AAACGATAAT ATAAAGGATT 160 

TGCTAGACTG GTATAGTAGT GGGTCTGACA CTTTTACAAA 200 

TAGTGAAGTT TTAGATAATT CCAGAGGATC TATGCGTATA 240 

AAAAACACAG ATGGCAGCAT CAGCTTGATA ATTTTTCCGA 280 



155 



Asp Tyr Leu Thr Arg 

165 



Asn Lys Lys Leu Tyr 
175 

Pro Tyr Glu Thr Gly 

185 



Glu Asn Glu Asn Ser 
195 

Met Pro Ala Pro Gly 
205 

Ser Lys Tyr Leu Met 
215 



,A Lys Met Val Asp Ser 



h Glu Val Tyr Leu Thr 

m 235 
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GTCCTTATTA 


TAGCCCTGCT 


TTTACAAAAG GGGAAAAAGT 


320 


TGACTTAAAC 


ACAAAAAGAA 


CTAAAAAAAG 


CCAACATACT 


360 


AGCGAAGGAA 


CTTATATCCA 


TTTCCAAATA 


AGTGGCGTTA 


400 


CAAATACTGA 


AAAATTACCT 


ACTCCAATAG 


AACTACCTTT 


440 


AAAAGTTAAG 


GTTCATGGTA 


AAGATAGCCC 


CTTAAAGTAT 


480 


GGGCCAAAGT 


TCGATAAAAA 


ACAATTAGCT ATATCAACTT 


520 


TAGACTTTGA 


AATTCGTCAT 


CAGCTAACTC 


AAATACATGG 


560 


ATTATATCGT 


TCAAGCGATA 


AAACGGGTGG 


TTATTGGAAA 


600 


ATAACAATGA 


ATGACGGATC 


CACATATCAA 


AGTGATTTAT 


640 


CTAAAAAGTT 


TGAATACAAT 


ACTGAAAAAC 


CACCTATAAA . 


680 


TATTGATGAA 


ATAAAAACTA 


TAGAAGCAGA 


AATTAATTAA 


720 


TTTACCACTT 


T 






731 



C3 

□ 
fU 

li (13) INFORMATION FOR SEQUENCE ID NO: 12: 

r (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 

(B) TYPE: Amino Acid 

^ (C) STRANDEDNESS : Unknown 

|.y (D) TOPOLOGY: Unknovm 

m 

£3 (ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Asn Lys Lys Leu Leu Met Asn Phe Phe 

5 10 

lie Val Ser Pro Leu Leu Leu Ala Thr Thr 
15 20 

Ala Thr Asp Phe Thr Pro Val Pro Leu Ser 

25 30 

Ser Asn Gin lie lie Lys Thr Ala Lys Ala 
35 40 

Ser Thr Asn Asp Asn lie Lys Asp Leu Leu 
45 50 

Asp Trp Tyr Ser Ser Gly Ser Asp Thr Phe 
55 60 
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Thr Asn Ser Glu Val Leu Asp Asn Ser Arg 
65 70 

Gly Ser Met Arg lie Lys Asn Thr Asp Gly 

75 80 

Ser He Ser Leu He He Phe Pro Ser Pro 
85 90 

Tyr Tyr Ser Pro Ala Phe Thr Lys Gly Glu 

95 100 

Lys Val Asp Leu Asn Thr Lys Arg Thr Lys 
105 110 

Lys Ser Gin His Thr Ser Glu Gly Thr Tyr 
115 120 



Q 
□ 
P 
fU 



He His Phe Gin He Ser Gly Val Thr Asn 
125 130 

Thr Glu Lys Leu Pro Thr Pro He Glu Leu 
135 140 

Pro Leu Lys Val Lys Val His Gly Lys Asp 
145 150 



ru 
a 



Ser Pro Leu Lys Tyr Gly Pro Lys Phe Asp 
155 160 

Lys Lys Gin Leu Ala He Ser Thr Leu Asp 
165 170 

Phe Glu He Arg His Gin Leu Thr Gin He 
175 180 



His Gly Leu Tyr Arg Ser Ser Asp Lys Thr 

185 190 

Gly Gly Tyr Trp Lys He Thr Met Asn Asp 
195 200 

Gly Ser Thr Tyr Gin Ser Asp Leu Ser Lys 

205 210 

Lys Phe Glu Tyr Asn Thr Glu Lys Pro Pro 
215 220 

He Asn He Asp Glu He Lys Thr He Glu 
225 230 



Ala Glu He Asn 
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(14) INF0R14ATI0N FOR SEQUENCE ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATCATTAAAT ATAATTAATT TTCTTTTAAT ATTTTTTTAA 40 

TTGAATATTT AAGATTATAA GATATATTTA AAGTGTATCT 80 

AGATACTTTT TGGGAATGTT GGATGAAGGA GATAAAAATG 120 

AATAAGAGTC GATTTATTTC ATGCGTAATT TTGATATTCG 160 

C3 CACTTATACT AGTTCTTTTT ACACCCAACG TATTAGCAGA 200 

fU GAGCCAACCA GACCCTACGC CAGATGAGTT GCACAAAGCG 240 

fp AGTAAATTCA CTGGTTTGAT GGAAAATATG AAAGTTTTAT 280 



ru 
cn 



ATGATGATCA TTATGTATCA GCAACTAAAG TTAAGTCTGT 320 

AGATAAATTT AGGGCACATG ATTTAATTTA TAACATTAGT 360 

GATAAAAAAC TGAAAAATTA TGACAAAGTG AAAACAGAGT 400 

TATTAAATGA AGGTTTAGCA AAGAAGTACA AAGATGAAGT 440 

AGTTGATGTG TATGGATCAA ATTACTATGT AAACTGCTAT 480 

TTTTCATCCA AAGATAATGT AGGTAAAGTT ACAGGTGGCA 520 

AAACTTGTAT GTATGGAGGA ATAACAAAAC ATGAAGGAAA 560 

CCACTTTGAT AATGGGAACT TACAAAATGT ACTTATAAGA 600 

GTTTATGAAA ATAAAAGAAA CACAATTTCT TTTGAAGTGC 640 

AAACTGATAA GAAAAGTGTA ACAGCTCAAG AACTAGACAT 680 

AAAAGCTAGG AATTTTTTAA TTAATAAAAA AAATTTGTAT 720 

GAGTTTAACA GTTCACCATA TGAAACAGGA TATATAAAAT 760 



TTATTGAAAA TAACGGCAAT ACTTTTTGGT ATGATATGAT 



800 
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ru 
cn 

C3 



GCCTGCACCA 


GGCGATAAGT 


TTGACCAATC 


TAAATATTTA 


840 


ATGATGTACA 


ACGACAATAA 


AACGGTTGAT 


TCTAAAAGTG 


880 


TGAAGATAGA 


AGTCCACCTT 


ACAACAAAGA 


ATGGATAATG 


920 


TTAATCCGAT 


TTTGATATAA 


AAAGTGAAAG 


TATTAGATAT 


960 


ATTTGAAAGG 


TAAGTACTTC 


GGTGCTTGCC 


TTTTTAGGAT 


1000 


GCATATATAT 


AGATTAAACC 


GCACTTCTAT 


ATTAATAGAA 


1040 


AGTGCGGTTA 


TTTATACACT 


CAATCTAAAC 


TATAATAATT 


1080 


GGAATCATCT 


TCAAA 






1095 



(15) INFORMATION FOR SEQUENCE ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 266 
p (B) TYPE: Amino Acid 

p (C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 



''4 (ii) Molecule type: Peptide 

CQ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Met Asn Lys Ser Arg Phe He Ser Cys Val 
5 10 

He Leu He Phe Ala Leu He Leu Val Leu 
15 20 

Phe Thr Pro Asn Val Leu Ala Glu Ser Gin 
25 30 

Pro Asp Pro Thr Pro Asp Glu Leu His Lys 
35 40 

Ala Ser Lys Phe Thr Gly Leu Met Glu Asn 
45 50 

Met Lys Val Leu Tyr Asp Asp His Tyr Val 
55 60 

Ser Ala Thr Lys Val Lys Ser Val Asp Lys 
65 70 

Phe Arg Ala His Asp Leu He Tyr Asn He 
75 80 
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Ser Asp Lys Lys Leu Lys Asn Tyr Asp Lys 
85 90 

Val Lys Thr Glu Leu Leu Asn Glu Gly Leu 

95 100 

Ala Lys Lys Tyr Lys Asp Glu Val Val Asp 
105 110 

Val Tyr Gly Ser Asn Tyr Tyr Val Asn Cys 

115 120 

Tyr Phe Ser Ser Lys Asp Asn Val Gly Lys 
125 130 

Val Thr Gly Gly Lys Thr Cys Met Tyr Gly 
135 140 

Gly lie Thr Lys His Glu Gly Asn His Phe 
U 145 150 

b 

i==t Asp Asn Gly Asn Leu Gin Asn Val Leu He 

g 155 160 

Arg Val Tyr Glu Asn Lys Arg Asn Thr He 
165 170 

Ser Phe Glu Val Gin Thr Asp Lys Lys Ser 
s 175 180 

i''^ Val Thr Ala Gin Glu Leu Asp He Lys Ala 

rU 185 190 

m 

Q Arg Asn Phe Leu He Asn Lys Lys Asn Leu 

. 195 200 

Tyr Glu Phe Asn Ser Ser Phe Tyr Glu Thr 

205 210 

Gly Tyr He Lys Phe He Glu Asn Asn Gly 
215 220 

Asn Thr Phe Trp Ty^ Asp Met Met Pro Ala 

225 230 

Pro Gly Asp Lys Phe Asp Gin Ser Lys Tyr 
235 240 

Leu Met Met Tyr Asn Asp Asn Lys Thr Val 
245 250 

Asp Ser Lys Ser Val Lys He Glu Val His 
255 260 
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Leu Thr Thr L"^^s Asn Gly 
265 



(16) INFORMATION FOR SEQUENCE ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1837 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCATGTTTGA CAGCTTATCA TCGATAAGCT TACTTTTCGA 40 
j,^ ATCAGGTCTA TCCTTGAAAC AGGTGCAACA TAGATTAGGG 80 

P CATGGAGATT TACCAGACAA CTATGAACGT ATATACTCAC 120 



□ 

ru 



ATCACGCAAT CGGCAATTGA TGACATTGGA ACTAAATTCA 160 

ATCAATTTGT TACTAACAAG CAACTAGATT GACAACTAAT 200 

TCTCAACAAA CGTTAATTTA ACAACATTCA AGTAACTCCC 240 

B 

ACCAGCTCCA TCAATGCTTA CCGTAAGTAA TCATAACTTA 280 

\t 

fy CTAAAACCTT GTTACATCAA GGTTTTTTCT TTTTGTCTTG 320 

C3 TTCATGAGTT ACCATAACTT TCTATATTAT TGACAACTAA 3 60 

ATTGACAACT CTTCAATTAT TTTTCTGTCT ACTCAAAGTT 400 

TTCTTCATTT GATATAGTCT AATTCCACCA TCACTTCTTC 440 

CACTCTCTCT ACCGTCACAA CTTCATCATC TCTCACTTTT 480 

TCGTGTGGTA ACACATAATC AAATATCTTT CCGTTTTTAC 520 

GCACTATCGC TACTGTGTCA CCTAAAATAT ACCCCTTATC 560 

T^TCGCTTCT TTAAACTCAT CTATATATAA CATATTTCAT 600 

CCTCCTACCT ATCTATTCGT AAAAAGATAA AAATAACTAT 640 

TGTTTTTTTT GTTATTTTAT AATAAAATTA TTAATATAAG 680 

TTAATGTTTT TTAAAAATAT ACAATTTTAT TCTATTTATA 720 
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□ 

C3 
C3 

ru 

m 
■P 

3 

u 
fU 
CP 
Q 



GTTAGCTATT 


TTTTCATTGT 


TAGTAATATT 


GGTGAATTGT 


760 


AATAACCTTT 


TTAAATCTAG 


AGGAGAACCC 


AGATATAAAA 


800 


TGGAGGAATA 


TTAATGGAAA 


ACAATAAAAA 


AGTATTGAAG 


840 


AAAATGGTAT 


TTTTTGTTTT 


AGTGACATTT 


CTTGGACTAA 


880 


CAATCTCGCA AGAGGTATTT GCTCAACAAG 


ACCCCGATCC 


920 


AAGCCAACTT 


CACAGATCTA 


GTTTAGTTAA 


AAACCTTCAA 


960 


AATATATATT 


TTCTTTATGA 


GGGTGACCCT 


GTTACTCACG 


1000 


AGAATGTGAA 


ATCTGTTGAT 


CAACTTAGAT 


CTCACGATTT 


1040 


AATATATAAT 


GTTTCAGGGC 


CAAATTATGA 


TAAATTAAAA 


1080 


ACTGAACTTA 


AGAACCAAGA 


GATGGCAACT 


TTATTTAAGG 


1120 


ATAAAAACGT 


TGATATTTAT 


GGTGTAGAAT 


ATTACCATCT 


1160 


CTGTTATTTA 


TGTGAAAATG 


CAGAAAGGAG 


TGCATGTATC 


1200 


TACGGAGGGG 


TAACAAATCA 


TGAAGGGAAT 


CATTTAGAAA 


1240 


TTCCTAAAAA 


GATAGTCGTT 


AAAGTATCAA 


TCGATGGTAT 


1280 


CCAAAGCCTA 


TCATTTGATA 


TTGAAACAAA 


TAAAAAAATG 


1320 


GTAACTGCTC 


AAGAATTAGA 


CTATAAAGTT 


AGAAAATATC 


1360 


TTACAGATAA 


TAAGCAACTA 


TATACTAATG 


GACCTTCTAA 


1400 


ATATGAAACT 


GGATATATAA 


AGTTCATACC 


TAAGAATAAA 


1440 


GAAAGTTTTT 


GGTTTGATTT 


TTTCCCTGAA 


CCAGAATTTA 


1480 


CTCAATCTAA 


ATATCTTATG 


ATATATAAAG 


ATAATGAAAC 


1520 


GCTTGACTCA 


AACACAAGCC 


AAATTGAAGT 


CTACCTAACA 


1560 


ACCAAGTAAC 


TTTTTGCTTT 


TGGCAACCTT 


ACCTACTGCT 


1600 


GGATTTAGAA 


ATTTTATTGC 


AATTCTTTTA 


TTAATGTAAA 


1640 


AACCGCTCAT 


TTGATGAGCG 


GTTTTGTCTT 


ATCTAAAGGA 


1680 


GCTTTACCTC 


CTAATGCTGC 


AAAATTTTAA 


ATGTTGGATT 


1720 


TTTGTATTTG 


TCTATTGTAT 


TTGATGGGTA 


ATCCCATTTT 


1760 


TCGACAGACA 


TCGTCGTGCC 


ACCTCTT^CA 


CCAAAATCAT 


1800 
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AGACAGGAGC TTGTAGCTTA GCAACTATTT TATCGTC 1837 



(17) INFORMATION FOR SEQUENCE ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 
(ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Glu Asn Asn Lys Lys Val Leu Lys Lys 
5 10 

Met Val Phe Phe Val Leu Val Thr Phe Leu 
15 20 

Gly Leu Thr lie Ser Gin Glu Val Phe Ala 
D 25 30 

□ 

C3 Gin Gin Asp Pro Asp Pro Ser Gin Leu His 

rU 35 40 

fO Arg Ser Ser Leu Val Lys Asn Leu Gin Asn 

li 45 50 



ru 
m 
a 



lie Tyr Phe Leu Tyr Glu Gly Asp Pro Val 
55 60 

Thr His Glu Asn Val Lys Ser Val Asp Gin 
65 70 

Leu Arg Ser His Asp Leu lie Tyr Asn Val 

75 80 

Ser Gly Pro Asn Tyr Asp Lys Leu Lys Thr 
85 90 

Glu Leu Lys Asn Gin Glu Met Ala Thr Leu 

95 100 

Phe Lys Asp Lys Asn Val Asp lie Tyr Gly 
105 110 

Val Glu Tyr Tyr His Leu Cys Tyr Leu Cys 
115 120 

Glu Asn Ala Glu Arg Ser Ala Cys He Tyr 
125 130 

Gly Gly Val Thr Asn His Glu Gly Asn His 
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135 140 

Leu Glu lie Pro Lys Lys lie Val Val Lys 

145 150 

Val Ser lie Asp Gly lie Gin Ser Leu Ser 
155 160 

Phe Asp lie Glu Thr Asn Lys Lys Met Val 

165 170 

Thr Ala Gin Glu Leu Asp Tyr Lys Val Arg 
175 180 

Lys Tyr Leu Thr Asp Asn Lys Gin Leu Tyr 
185 190 

Thr Asn Gly Pro Ser Lys Tyr Glu Thr Gly 
195 200 

It. Tyr He Lys Phe He Pro Lys Asn Lys Glu 

P 205 210 

Q 

Q Ser Phe Trp Phe Asp Phe Phe Pro Glu Pro 

rU 215 220 

CO Glu Phe Thr Gin Ser Lys Tyr Leu Met He 

«p 225 230 

3 

U Tyr Lys Asp Asn Glu Thr Leu Asp Ser Asn 

235 240 



ru 

Q 



Thr Ser Gin He Glu Val Tyr Leu Thr Thr 
245 250 

Lys 



SEQ ID NO: 17 

sense primer for cloning SpeA 

5' CTCG CAA GAG GTA CAT ATG CAA CAA GAC 3; (SEQ ID NO: ) , 
sense primer to introduce a unique Ndel site; 

SEQ ID NO: 18 

antisense primer for cloning SpeA 

5' GCA GTA GGT AAG CTT GCC AAA AGC 3' (SEQ ID NO: ) 



SEQ ID NO: 19 

1. SpeA forward primer , including Ndel site: 
5' GATATACATATGCAACAAGACCCCGATCCAAGCC 3' 34-mer 
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SEQIDNO:20 

SpeA reverse primer , adds SpeB overlap 

5' GAGATTTAACAACTGGTTGCTTGGTTGTTAGGTAGAC 3' 37-iner 
SEQ ID NO: 21 

3. SpeB forward primer adds SpeA overlap: 
5 ' GTCTACCTAACAACCAAGC AACCAGTTGTTAAATCTC 3 ' 37-mer 
SEQ ID NO: 22 

4. SpeB reverse orimer : adds stop site (Amber) and maintains BamHI site: 
5' GAATTCGGATCCGCTAGCCTACAACAG 3' 27-mer 

SEQ ID NO: 23 

SpeA (L42R) -SpeB (C47S) gene insert DNA sequence 
1 

M atgcaacaag accccgatcc aagccaactt cacagatcta gtttagttaa 

a 51 

C3 aaaccttcaa aatatatatt ttctttatga gggtgaccct gttactcacg 

C3 101 

ry agaatgtgaa atctgttgat caacttcgat ctcacgattt aatatataat 

fu gtttcagggc caaattatga taaattaaaa actgaactta agaaccaaga 

Ip 201 

g gatggcaact ttatttaagg ataaaaacat tgatatttat ggtgtagaat 

M 251 

«^ attaccatct ctgttattta tgtgaaaatg cagaaaggag tgcatgtatc 

301 

tacggagggg taacaaatcg tgaagggaat catttagaaa ttcctaaaaa 
351 

— gatagtcgtc aaagtatcaa tcgatggtat acaaagccta tcatttgata 

401 

ttgaaacaaa taaaaaaatg gtaactgctc aagaattaga ctataaagtt 
451 

agaaaatatc ttacagataa taagcaacta tatactaatg gaccttctaa 
501 

atatgaaact ggatatataa agttcatacc taagaataaa gaaagttttt 
551 

ggtttgattt tttccctgaa ccagaattta ctcaatctaa atatcttatg 
601 

atatataaag ataatgaaac gcttgactca aacacaagcc aaattgaagt 
651 

ctacctaaca accaagcaac cagttgttaa atctctcctt gattcaaaag 
701 

gcattcatta caatcaaggt aacccttaca acctattgac acctgttatt 
751 

gaaaaagtaa aaccaggtga acaatctttt gtaggtcaac atgcagctac 
801 

aggatgtgtt gctactgcaa ctgctcaaat tatgaaatat cataattacc 
851 



ru 

m 
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ctaacaaagg gttgaaagac tacacttaca cactaagctc aaataaccca 
901 

tatttcaacc atcctaagaa cttgtttgca gctatctcta ctagacaata 
951 

caactggaac aacatcctac ctacttatag cggaagagaa tctaacgttc 
1001 

aaaaaatggc gatttcagaa ttgatggctg atgttggtat ttcagtagac 
1051 

atggattatg gtccatctag tggttctgca ggtagctctc gtgttcaaag 
1101 

agccttgaaa gaaaactttg gctacaacca atctgttcac caaatcaacc 
1151 

gtagcgactt tagcaaacaa gattgggaag cacaaattga caaagaatta 

1201 

tctcaaaacc aaccagtata ctaccaaggt gtcggtaaag taggcggaca 
1251 

tgcctttgtt atcgatggtg ctgacggacg taacttctac catgttaact 
1301 

ggggttgggg tggagtctct gacggcttct tccgtcttga cgcactaaac 
1351 

ccttcagctc ttggtactgg tggcggcgca ggcggcttca acggttacca 
P 1401 

M aagtgctgtt gtaggctag 

C3 

ru 

SEQ ID NO: 24 

p Full-length SpeB polypeptide (Kagawa et al . , PNAS 97:2235-2240. 2000): 

«P 1 MNKKKLGIRL LSLLALGGFV LANPVFADQN FARNEKEAKD SAITFIQKSA AIKAGARSAE 

^ 61 DIKLDKVNLG GELSGSNMYV YNISTGGFVI VSGDKRSPEI LGYSTSGSFD ANGKENIASF 

l« 121 MESYVEQIKE NKKLDTTYAG TAEIKQPWK SLLDSKGIHY NQGNPYNLLT PVIEKVKPGE 

181 QSFVGQHAAT GCVATATAQI MKYHNYPNKG LKDYTYTLSS NNPYFNHPKN LFAAISTRQY 

m 241 NWNNILPTYS GRESNVQKMA ISELMADVGI SVDMDYGPSS GSAGSSRVQR ALKENFGYNQ 

j.« 3 01 SVHQINRGDF SKQDWEAQID KELSQNQPVY YQGVGKVGGH AFVIDGADGR NFYHVNWGWG 

|« 361 GVSDGFFRLD ALNPSALGTG GGAGGFNGYQ SAWGIKP 
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SEQ ID NO: 25 

SpeB clone used for fusion, mature polypeptide. Estimated M,,= 28.75 kDa 

1 QPWKSLLDS KGIHYNQGNP YNLLTPVIEK VKPGEQSFVG QHAATGCVAT 

51 ATAQIMKYHN YPNKGLKDYT YTLSSNNPYF NHPKNLFAAI STRQYNWNNI 

101 LPTYSGRESN VQKMAISELM ADVGISVDMD YGPSSGSAGS SRVQRALKEN 

151 FGYNQSVHQI NRSDFSQDWE AQIDKELSQN QPVYYQGGKV GGHAFVIDGA 

201 DGRNFYHVNW GWGGVSDGFF RLDALNPSAL GTGGGAGGFN GYQSAWG 

SEQ ID NO: 26 

L42R SpeA mutant clone used for fusion. Estimated U^= 2 5.2 kDa 

1 MQQDPDPSQL HRSSLVKNLQ NIYFLYEGDP VTHENVKSVD QLRSHDLIYN 

51 VSGPNYDKLK TELKNQEMAT LFKDKNIDIY GVEYYHLCYL CENAERSACI 

101 GGVTNREGNH LEIPKKIWK VSIDGIQSLS FDIETNKKMV TAQELDYKVR 

151 KYLTDNKQLY TNGPSKYETG YIKFIPKNKE SFWFDFFPEP EFTQSKYLMI 
2 01 YKDNETLDSN TQIEVYLTTK 



SEQ ID NO: 27 

SpeA [L42R]-SpeB [C47S3 fusion. 



Estimated 54 kDa 



1^ 

m 



1 MQQDPDPSQL 

51 VSGPNYDKLK 

101 GGVTNREGNH 

151 KYLTDNKQLY 

201 YKDNETLDSN 

251 VKPGEQSFVG 

301 NHPKNLFAAI 

351 YGPSSGSAGS 

401 QPVYYQGGKV 

551 GTGGGAGGFN 



HRSSLVKNLQ 
TELKNQEMAT 
LEIPKKIWK 
TNGPSKYETG 
TQIEVYLTTK 
QHAATGCVAT 
STRQYNWNNI 
SRVQRALKEN 
GGHAFVIDGA 
GYQSAWG 



NIYFLYEGDP 
LFKDKNIDIY 
VSIDGIQSLS 
YIKFIPKNKE 
QPWKSLLDS 
ATAQIMKYHN 
LPTYSGRESN 
FGYNQSVHQI 
DGRNFYHVNW 



VTHENVKSVD 

GVEYYHLCYL 
FDIETNKKMV 
SFWFDFFPEP 
KGIHYNQGNP 
YPNKGLKDYT 
VQKMAISELM 
NRSDFSQDWE 
GWGGVSDGFF 



QLRSHDLIYN 

CENAERSACI 
TAQELDYKVR 
EFTQSKYLMI 
YNLLTPVIEK 
YTLSSNNPYF 
ADVGISVDMD 
AQIDKELSQN 
RLDALNPSAL 



ru 

Cn 
□ 



SEQ ID NO: 28 

1, SpeA forward primer, including Ndel site: 

5' GATATACATATGCAACAAGACCCCGATCCAAGCC 3' 34-mer 

SEQ ID NO: 29 

2. SpeA reverse primer : kills Ndel site, adds SpeB overlap: 



5 ' CATGTGTATATCTCCTTCCTTGGTTGTTAGGTAGAC 3 ' 36 -mer 
SEQ ID NO: 30 

3. SpeB forward primer : kills Ndel site, adds SpeA overlap: 

5' GTCTACCTAACAACCAAGGAAGGAGATATACACATG 3' 3 6 -mer 



SEQ ID NO: 31 

4. SpeB reverse primer : adds stop site (Amber) and maintains BamHI site: 

5' GAATTCGGATCCGCTAGCCTACAACAG 3' 2 7 -mer 



